WHAT IS CLAIMED IS: 



1 LA method of predicting aggregate behavior of a population, the method 

2 comprising: 

3 providing a modehng system configured to model aggregate behavior of a 

4 population as a function of aggregate on-line interest data, the on-line interest data based on 

5 passive observation of on-line behavior of a subpopulation, wherein the on-line behavior is 

6 related to, but different than, the behavior to be modeled, and wherein the subpopulation 

7 comprises a subset of the population; 

8 inputting to the modeling system on-line interest data related to a subject; 

9 generating, with the modeling system, a prediction of aggregate behavior 
j|) related to the subject. 

"^Jl 2. The method of claim 1 wherein the modeling system is further 

012 configiu^d to model aggregate behavior of the population as a function of characteristics of 

n3 the subject to which the aggregate behavior is related, the method further comprising 

"LA inputting to the modeling system data related to characteristics of the subject. 

1 3, The method of claim 1 further comprising training the modeling 

pj2 system with a learning data set, the learning data set including: 

3 on-hne interest data related to another subject, the another subject related to 

4 the subject; and 

5 actual aggregate behavior data relating to the another subject. 

1 4, The method of claim 1 wherein the on-line interest data includes on- 

2 line usage data, 

1 5. The method of claim 1 wherein the aggregate behavior to be modeled 

2 is aggregate economic activity. 

1 6. The method of claim 5 wherein the aggregate economic activity to be 

2 modeled is related to a product, 

1 7. The method of claim 6 wherein the product is selected from the group 

2 consisting of a movie, a video tape, a CD, a DVD, a model of automobile, a book, a toy, an 

3 appliance, an electronic device, a pharmaceutical product, and a software product. 



22 



1 8. The method of claim 5 wherein the aggregate economic activity to be 

2 modeled is related to a service. 

1 9. The method of claim 5 wherein the aggregate economic activity to be 

2 modeled is related to a financial security, 

1 10. The method of claim 1 wherein the aggregate behavior to be modeled 

2 is an extent of a disease. 

1 1 1 . A system for predicting aggregate behavior of a population, the system 

2 comprising: 

^ a modeling system configured to model aggregate behavior of a population as 

CM a function of aggregate on-line interest data, the on-line interest data based on passive 

{5 observation of on-line behavior of a subpopulation, wherein the on-line behavior is related to, 

5)5 but different than, the behavior to be modeled, and wherein the subpopulation comprises a 

subset of the population; and 
rfi a module for receiving on-line interest data related to a subject and providing 

'9 the on-line interest data to the modeling system; 
Mo wherein the modeling system generates a prediction of aggregate behavior 

3 1 related to the subject using the on-line interest data. 

1 12. The system of claim 1 1 wherein the modeling system is further 

2 configured to model aggregate behavior of a population as a function of characteristics of the 

3 subject to which the aggregate behavior is related, the system further including a module for 

4 receiving data related to characteristics of the subject and providing the data related to 

5 characteristics of the subject to the modeling system. 

1 13. The system of claim 1 1 further including a training module that trains 

2 the modeling system with a learning data set, wherein the learning data set includes: 

3 on-line interest data related to another subject, the another subject related to 

4 the subject; and 

5 actual aggregate behavior data relating to the another subject. 

1 14. A method of training a modeling system to predict aggregate behavior 

2 of a population, the method comprising: 
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3 providing a modeling system; 

4 providing a learning data set including: 

5 actual aggregate behavior data related to a first subject; and 

6 aggregate on-line interest data related to the first subject, the on-line 

7 interest data based on passive observation of on-line behavior 

8 of a subpopulation, wherein the on-line behavior is related to, 

9 but different than, the actual behavior, and wherein the 

10 subpopulation comprises a subset of the population; 

1 1 training the modeling system with the learning data set to minimize the error 

12 between a predicted aggregate behavior related to the first subject generated by the modeling 
fl3 system and the actual aggregate behavior related to the first subject, 

^fl 15. The method of claim 1 4 wherein the learning data set further includes: 

actual aggregate behavior data related to a second subject related to the first 

fii3 subject; and 

^"" 4 aggregate on-line interest data related to the second subject, the on-line 

Q 5 interest data related to the second subject based on passive observation of on-line behavior of 

T2 6 the subpopulation, wherein the on-line behavior is related to, but different than, the actual 

7 behavior; 

I.:. 8 wherein training the modeling system with the learning data set includes 

9 minimizing the mean-square error between the predicted aggregate behavior related to the 

10 first subject generated by the modeling system and the actual aggregate behavior related to 

1 1 the first subject and between a predicted aggregate behavior related to the second subject 

12 generated by the modeling system and the actual aggregate behavior related to the second 

13 subject. 

1 16. A method of predicting a measure of aggregate economic activity 

2 related to a product, the method comprising: 

3 providing a modeling system configured to model aggregate economic activity 

4 of a type of product as a function of aggregate on-line interest data related to products 

5 comprising the type, wherein the on-line interest data is based on passive observation of on- 

6 line behavior of a subpopulation, wherein the on-line behavior is related to, but different than, 

7 the economic activity to be modeled, and wherein the subpopulation comprises a subset of a 

8 population that engages in the economic activity to be modeled; 
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9 inputting to the modeling system on-line interest data related to a first product 

10 comprising the type; and 

1 1 generating a prediction of the measure of aggregate economic activity related 

12 to the first product with the modeling system. 

1 17. The method of claim 1 6 wherein the modeling system is fiirther 

2 configured to model aggregate economic activity of the type of product as a function of 

3 characteristics of products comprising the type, the method fiirther comprising inputting to 

4 the modeling system data related to characteristics of the first product. 

1 18. The method of claim 1 7 further comprising training the modeling 

^ system with a learning data set, the learning data set including: 

on-line interest data related to a second product comprising the type; 
J!4 data related to characteristics of the second product; and 

J; 5 aggregate economic activity data relating to the second product. 

^1 19. The method of claim 1 8 wherein training the model includes: 

m 2 adding to the learning data set additional data related to characteristics of the 

i: 3 second product; and 

Cl 4 retraining the modeling system with the learning data set. 

1 20. The method of claim 16 further comprising training the modeling 

2 system with a learning data set, the learning data set including: 

3 on-line interest data related to a second product comprising the type; and 

4 aggregate economic activity data relating to the second product. 

1 21, The method of claim 20 wherein training the model includes: 

2 adding to the learning data set additional on-line interest data related to the 

3 second product; and 

4 retraining the modeling system with the learning data set. 

1 22. The method of claim 16 wherein the on-line interest data related to the 

2 first product includes counts of page hits of a web page related to the first product. 
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1 23 . The method of claim 1 6 wherein the on-hne interest data related to the 

2 first product includes counts of search queries at a web site that include a phrase related to the 

3 first product. 

1 24. The method of claim 1 6 wherein the on-line interest data related to the 

2 first product includes an on-line interest measurement provided by a web site. 

1 25 . The method of claim 24 wherein the on-line interest measurement 

2 provided by a web site is a fictional stock price of the first product. 

1 26. The method of claim 24 wherein the on-line interest measurement 
C| provided by a web site is a percentage of users of the web site initiating searches related to 
:;3 the first product. 

|| 27. The method of claim 1 6 wherein the on-line interest data related to the 

first product includes aggregate Internet usage data related to the first product. 

-| 28. The method of claim 27 wherein the aggregate Internet usage data 

?5 related to the first product includes statistics based on analyses of online events related to the 

2B first product. 

^\ 29, The method of claim 28 wherein online events include a result of a 

2 client making a request of a server and the server providing a response to the cUent. 

1 30. The method of claim 28 wherein the analyses of online events 



2 includes: 

3 automatically associating each online event with one or more subjects; 

4 accumulating counts for events by subject; and 

5 outputting the accumulated counts for each subject. 

1 31. The method of claim 3 0 wherein the analyses of online events fixrther 

2 includes: 

3 identifying one or more categories relevant to each subject; 

4 accumulating counts for events by category; and 

5 outputting the accumulated counts for each category. 
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1 32. The method of claim 30 wherein the analyses of online events further 

2 includes determining if a subject for an event is a canonical equivalent of another subject; and 

3 wherein counts for canonical equivalents are accumulated together. 

1 33. The method of claim 30 wherein the analyses of online events further 

2 includes normalizing counts for events over a field of events, and wherein outputting the 

3 accumulated counts includes outputting the normalized counts. 

1 34. The method of claim 30 wherein the analyses of onhne events further 

2 includes: 

r 3 determining a set of one or more demographic parameters relating to users that 

l-l prompt the events; and 

OS using the set of one or more demographic parameters to partition the counts by 

1/6 demographic divisions. 

^1 3 5 . The method of claim 1 6 wherein the first product is selected fi*om the 

OZ group consisting of a movie, a video tape, a CD, a DVD, a model of automobile, a book, a 

Z3 toy, an appliance, an electronic device, a pharmaceutical product, and a software product. 

-^1 36. The method of claim 16 wherein the predicted measure of aggregate 

2 economic activity is a predicted number of sales during a period of time. 

1 37. The method of claim 16 wherein the predicted measure of aggregate 

2 economic activity is a predicted monetary value of sales during a period of time. 

1 38. A system for predicting a measure of aggregate economic activity 

2 related to a product, the system comprising: 

3 a modeling system configured to model aggregate economic activity of a type 

4 of product as a function of aggregate on-line interest data related to products comprising the 

5 type, wherein the on-line interest data is based on passive observation of on-line behavior of a 

6 subpopulation, wherein the on-line behavior is related to, but different than, the economic 

7 activity to be modeled, and wherein the subpopulation comprises a subset of a population that 

8 engages in the economic activity to be modeled; and 

9 a module for receiving on-line interest data related to a first product 
10 comprising the type and providing the on-line interest data to the modeling system; 
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1 1 wherein the modeUng system generates a predicted measure of economic 

12 activity related to the first product using the on-line interest data. 

1 39. The system of claim 38 wherein the modeling system is further 

2 configured to model aggregate economic activity of the type of product as a function of 

3 characteristics of products comprising the type, the system fiirther including a module for 

4 receiving data related to characteristics of the first product and providing the data related to 

5 characteristics of the first product to the modeling system, 

1 40. The system of claim 39 fiirther including a training module that trains 

2 the modeling system with a learning data set, wherein the learning data set includes: 
C3 on-line interest data related to a second product comprising the type; 
gl data related to characteristics of the second product; and 

aggregate economic activity data related to the second product. 

i ll 41 . The system of claim 3 8 fiuther including a training module that trains 

[ 2 the modeling system with a learning data set, wherein the learning data set includes: 

on-line interest data related to a second product comprising the type; and 
I^i4 aggregate economic activity data related to the second product. 

^1 42. The system of claim 38 fiirther comprising an aggregate Internet usage 

2 statistics generator that provides aggregate Internet usage statistics related to the first product 

3 to the module for receiving on-line interest data, 

1 43 . The system of claim 42 wherein the aggregate Internet usage statistics 

2 gen er ator includes : 

3 an activity input for receiving data related to events on a set of servers; 

4 means for categorizing events into categories; 

5 means for associating events with subjects, wherein counts are maintained for 

6 each subject and wherein subjects are associated with categories; 

7 a normalizer for normalizing counts for events over a field of events; and 

8 a result output for outputting results of the normalizer as the online usage 

9 statistics. 

1 44, A method of training a modeling system to predict aggregate economic 

2 activity related to a product comprising a type of products, the method comprising: 



28 



3 providing a modeling system; 

4 providing a learning data set including: 

5 an actual measure of aggregate economic activity related to a first 

6 product comprising the type; and 

7 aggregate on-line interest data related to the first product^ the on-line 

8 interest data based on passive observation of on-line behavior 

9 of a subpopulation, wherein the on-line behavior is related to, 

10 but different than, the actual economic activity, and wherein the 

1 1 subpopulation comprises a subset of a population that engages 

12 in the economic activity; 

^3 training the modeling system with the learning data set to minimize the error 

14 between a predicted measure of aggregate economic activity related to the first product 

i s generated by the modeling system and the actual measure of aggregate economic activity 

If related to the first product. 

^\ 45. The method of claim 44 wherein the learning data set further includes: 

C2 an actual measure of aggregate economic activity related to a second product 

lVs comprising the type; 

aggregate on-line interest data related to the second product, the on-line 

; .-.5 interest data based on passive observation of on-line behavior of the subpopulation, wherein 

6 the on-line behavior is related to, but different than, the actual economic activity; 

7 wherein training the modeling system with the learning data set includes 

8 minimizing the mean-square error between the predicted measure of aggregate economic 

9 activity related to the first product generated by the modeling system and the actual measure 

10 of aggregate economic activity related to the first product and between the predicted measure 

11 of aggregate economic activity related to the second product generated by the modeling 

12 system and the actual measure of aggregate economic activity related to the second product. 
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